Noise Robust Speaker Identification Using Sub-Band Weighting in Multi-Band Approach

نویسندگان

  • Sungtak Kim
  • Mikyong Ji
  • Youngjoo Suh
  • Hoirin Kim
چکیده

Recently, many techniques have been proposed to improve speaker identification in noise environments. Among these techniques, we consider the feature recombination technique for the multi-band approach in noise robust speaker identification. The conventional feature recombination technique is very effective in the band-limited noise condition, but in broad-band noise condition, the conventional feature recombination technique does not provide notable performance improvement compared with the full-band system. Even though the speech is corrupted by the broad-band noise, the degree of the noise corruption on each sub-band is different from each other. In the conventional feature recombination for speaker identification, all sub-band features are used to compute multiband likelihood score, but this likelihood computation does not use a merit of multi-band approach effectively, even though the sub-band features are extracted independently. Here we propose a new technique of sub-band likelihood computation with sub-band weighting in the feature recombination method. The signal to noise ratio (SNR) is used to compute the subband weights. The proposed sub-band-weighted likelihood computation makes a speaker identification system more robust to noise. Experimental results show that the average error reduction rate (ERR) in various noise environments is more than 24% compared with the conventional feature recombination-based speaker identification system. key words: feature recombination, multi-band approach, speaker identification, sub-band likelihood, sub-band weighting

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum likelihood sub-band adaptation for robust speech recognition

Noise-robust speech recognition has become an important area of research in recent years. In current speech recognition systems, the Mel-frequency cepstrum coefficients (MFCCs) are used as recognition features. When the speech signal is corrupted by narrow-band noise, the entire MFCC feature vector gets corrupted and it is not possible to exploit the frequency-selective property of the noise si...

متن کامل

A multi-band approach based on the probabilistic union model and frequency-filtering features for robust speech recognition

Multi-band approach has recently been introduced for recognition of speech corrupted by frequency-localized noise, showing higher robustness than the traditional full-band approach. However, the multi-band approach has been found to be less robust for wide-band noise than the full-band approach. In this paper, we present a multi-band recognition system based on the combination of the probabilis...

متن کامل

Multi-scale Fractal Dimension for Speaker Identification System

This paper looks at the extraction of nonlinear turbulence information of the speech signal using Multi-Scale Fractal Dimension (MFD) as the additional feature to the convectional MFCC features. The MFD is estimated using both Box-Counting and Minkowiski-Bouligand dimensions. The proposed framework uses this feature extraction together with sub-band based speaker modeling rather than the wide-b...

متن کامل

HMM Sub-band Based Speaker Identification Using Linear and Non-linear Merging Techniques

Speaker identification is a powerful, non-invasive and inexpensive biometric technique. The recognition accuracy however deteriorates when noise levels only affect a specific band of frequency. In this paper we present a sub-band based speaker identification that intends to improve the live-testing performance. Each frequency sub-band is processed and classified independently. We also compare t...

متن کامل

Improved model training and automatic weight adjustment for multi-SNR multi-band speaker identification system

In our previous paper, we presented a speaker identification system using a multi-SNR multi-band method, and reported its robustness against environmental noises. This paper describes two modifications to the system for further enhancement of its noise-robustness. Firstly, 1/f noise is employed instead of white Gaussian noise to make noisy data for training multi-SNR GMMs. Secondly, recombinati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEICE Transactions

دوره 90-D  شماره 

صفحات  -

تاریخ انتشار 2007